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Abstract 

Let p be a fixed prime number, and N be a large integer. The 'Inverse Conjecture for 
the Gowers norm' states that if the "d-th Gowers norm" of a function / : F^ — > F is 
non-negligible, that is larger than a constant independent of N, then / can be non-trivially 
approximated by a degree d — 1 polynomial. The conjecture is known to hold for d = 2, 3 
and for any prime p. In this paper we show the conjecture to be false for p — 2 and for 
d = 4, by presenting an explicit function whose 4-th Gowers norm is non-negligible, but 
whose correlation any polynomial of degree 3 is exponentially small. 

Essentially the same result (with different correlation bounds) was independently ob- 
tained by Green and Tao [5] . Their analysis uses a modification of a Ramsey-type argument 
of Alon and Beigel pQ to show inapproximability of certain functions by low-degree polyno- 
mials. 

We observe that a combination of our results with the argument of Alon and Beigel 
implies the inverse conjecture to be false for any prime p, for d = p 2 . 



1 Introduction 

We consider multivariate functions over finite fields. The main question of interest here would 
be whether these functions can be non-trivially approximated by a low-degree polynomial. 

Fix a prime number p. Let F = ¥ p be the finite field with p elements. Let £ = e p be the 
primitive p-th root of unity. Denote by e(x) the exponential function taking x G F to £ x £ C. 
For two functions f,g: ¥ N -> F, let (f,g) := E x e(f(x) - g(x)). 

Definition 1.1: A function / is non-trivially approximable by a degree-d polynomial if 

l(/,2>l>€ 

for some polynomial g of degree at most d in F[cci ...xjv]- I 
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More precisely, in this definition we are looking at a sequence Jn of functions and of ap- 
proximating low-degree polynomials gjy in N variables, and let N grow to infinity. In this 
paper, the remaining parameters, that is the field size p, the degree d and the offset e are fixed, 
independent of N. 

A counting argument shows that a generic function can not be approximated by a polyno- 
mial of low degree. The problems of showing a specific given function to have no non-trivial 
approximation and of constructing an explicit non-approximable function have been exten- 
sively investigated, since solutions to these problems have many applications in complexity (cf. 
discussion and references in [H EJ [2] ) . 

This paper studies a technical tool that measures distance from low-degree polynomials. 
This is the Gowers norm, introduced in [3]. For a function / : F^ — > F and a vector y E F n , 
we take f y to be the directional derivative of / in direction y by setting 

fy( x ) = f(x + y) - f(x) 
For a fc-tuple of vectors yi-.yk we take the iterated derivative in these directions to be 

fyi—Vk = \fvi~Vk-l)y k 

It is easy to see that this definition does not depend on the ordering of yi—yk- 
The k-th Gowers "norm" of / is 

(^x,y 1 ...y k [ e (fyi...y k ( x ))]) 

More accurately, as shown in [3] , this is indeed a norm of the associated complex- valued function 
e(f) (for k > 2). 

It is easy to see that is 1 iff / is a polynomial of degree at most d. This is just 

another way of saying that all order- (d+ 1) iterative derivatives of / are zero if and only if / is 
a polynomial of degree at most d. It is also possible to see [I] that | (/, g) \ > e for g of degree at 
most d, implies H/Hf/d+i > e. That is to say, if / is non-trivially close to a degree-ci polynomial, 
this can be detectable via an appropriate Gowers norm. 

This discussion naturally leads to the inverse conjecture [U0E], that is if (d+ l)-th Gowers 
norm of / is non-trivial, then / is non-trivially approximable by a degree-d polynomial. This 
conjecture is easily seen to hold for d = 1 and has been proved also for d = 2 [3J [7J. It is of 
interest to prove this conjecture for higher values of d. 

In this paper we show this conjecture, which we will refer to as the 'Inverse Conjecture for 
the Gowers norm', or, informally, as ICGN, to be false. Let S n be the elementary symmetric 
polynomial of degree n in N variables, that is 

Sn{x) = ^ Tl Xi 

SC[N], \S\=ni£S 

We prove two claims about symmetric polynomials. Note that here and below a constant is 
absolute if it does not depend on N. 

First, we show Gowers norms of some symmetric polynomials to be non-trivial. 
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Theorem 1.2: There is an absolute positive constant e such that for any prime p 



S2p\\UP+ 2 > e i 



Here S2 P is viewed as a function over ¥ = F. 



Two versions of this result will be useful later. 



• A special case p = 2. 



5*4 1 1 jj4, > e 



(1) 



• An easy generalization: for any n > 2p, 



• n \\jJn-p + 2 > 6 



(2) 



In the second claim we show a specific symmetric polynomial to have no non-trivial approx- 
imation by polynomials of lower degree. 

Theorem 1.3: Let p = 2. For any polynomial g of degree 3 holds 



We conjecture the second claim of the theorem to be true for any prime number p, replacing 
3 with p + 1 and 4 with 2p. 

The combination of ([!]) and ([3]) shows ICGN to be false for p = 2 and d = 4. 
1.1 Related work 

Our results have a large overlap with a recent work of Green and Tao [5]. 

The paper of Green and Tao has two parts. In the first part ICGN is shown to be true when 
/ is itself a polynomial of degree less than p and d < p. In the second part, the conjecture 
is shown to be false in general. In particular the symmetric polynomial S4 is shown to be a 
counterexample for p = 2 and d = 4. 

To proof of non-approximability of S4 by lower-degree polynomials in [5] uses a modification 
of a Ramsey- type argument due to Alon and Beigel [T] . Very briefly, this argument shows that 
if a function over F2 has a non-trivial correlation with a multilinear polynomial of degree d, then 
its restriction to a subcube of smaller dimension has a non-trivial correlation with a symmetric 
polynomial of degree d. The problem of inapproximability by symmetric polynomials turns out 
to be easier to analyze. 

This argument gives a somewhat weaker bounds for non-inapproximability of S4, in that it 
shows (S^,g) < log _c (A r ) for any degree-3 polynomial g and for an absolute constant c > 0. 

On the other hand, this argument is more robust than our inapproximability argument. We 
observe below that it can be readily extended to the case of general prime p and, combined 
with show ICGN to be false for all p. 



(S4,g) I < exp{-<x/V} 



(3) 
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1.2 The case of a general prime field 

We briefly observe here that a minor adaptation of the Alon-Beigel argument, together with 
([2]), show the symmetric polynomial S p 2 to have a non-negligible (p 2 )-nd Gowers norm over 
F p and to have no good approximation by lower-degree polynomials. In that, S p 2 provides a 
counterexample to ICGN for any prime p. 

Indeed, by monotonicity of the Gowers norms ([4]), and since p > 2, a direct implication of 
© gives 

1 1 1 [ jj*pZ ^ c 

On the other hand, let g be a polynomial of degree less than p 2 in N variables such that 
(S p 2,g) > e. Note that the Alon-Beigel argument (as given in [I] and in [5]) does not seem 
to be immediately applicable in this case, since g does not have to be multilinear. A way 
around this obstacle, is to observe, via an averaging argument, that there is a copy of an N'- 
dimensional boolean cube {0, 1}^ , such that restrictions S' and g' of S p 2 and of g on this 
subcube satisfy {S',g') > e', and N',e' depend linearly on N,e'. Without loss of generality 
assume the coordinates of the boolean cube to be {1...N'} and consider the functions S',g' as 
functions in variables x\, ...,xj\f (with some fixed assignment of values to variables Xi, i > N'). 

Now, S' = Y^=o a iSi i s a symmetric polynomial of degree p 2 over , with aj = 1, and g' 
is a polynomial of a degree smaller than p 2 . Our gain is in that now g' can be replaced by a 
multilinear polynomial coinciding with g' on the boolean cube, and hence having a non-trivial 
correlation with S' on the boolean cube. 

Now, the Alon-Beigel argument can be applied to show that the symmetric polynomial S p 2 
has a non-trivial correlation with a symmetric polynomial h of a smaller degree over the boolean 
cube {0, 1}^ viewed as a subset of ¥ N . This, however, couldn't be true due to a theorem of 
Lucas, which implies that for a boolean vector x with Hamming weight w = Y2i=i x ii the value 
S p 2 (x) depends only on the 3-rd digit in the representation of w in base p, while the value of h 
depends only on the first 2 digits. 

This completes the argument. We conclude with an observation that this argument directly 
extends to S p k for any k > 1. 

Here is a brief overview of the rest of the paper. Section [2] defines relevant notions and 
contains proofs of several technical claims. Theorem II. 21 is proved in Section [3l Theorem 11.31 is 
proved in Section HI 

2 Some useful notions and claims 

2.1 Some multilinear polynomials and their properties 

In this sub-section we introduce and discuss certain polynomials over the finite field F. These 
polynomials can be conveniently viewed as multi-linear functions on matrices whose entries are 
elements of F, or formal variables with values in the field. A basic object we consider is a 
rectangular nx N matrix, N > n. A matrix M with rows r\...r n will be denoted by M[ri...r n ]. 
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Sometimes there will be repeated rows. In such a case we consider a partition A = (Ai...A&) of 
[n], that is Aj are (possibly empty) subsets of [n], whose disjoint union is [n]. We denote by 
M\[ri...rk\ the matrix whose rows in positions indexed by elements of Aj equal r^. Note that 
the partition A is ordered, in that the ordering of the sets \ is relevant. We use the notation 
{Ai...Afc} for an unordered partition. 

First, we introduce the "symmetric" function S. We define S{M) to be the sum of all the 
permanental minors of M, that is 

S{M) := J2 Per ( M c0 > 

CC[N],\C\=n 

where Mc is an n x n submatrix of M which is obtained by deleting all the columns of M 
except these with indices in C. 

Let A = (Ai...Afc) be a partition of [n], and set £i = |A,|. Clearly S (M\) depends only on 



Ah) (4) 



r k 



which denotes the matrix 



the cardinalities t- L of \. This leads to the notation M 
in which the row r\ appears £\ times, followed by £2 appearances of the row T2 and so on. In 
this notation, therefore 



S{M {x ^ Xk) [ ri ...r k ])=s(M 



S\M) _(|A fc |) 
1 — r k 



The second matrix function we consider is the "forward" function J 7 , with 

n 

F(M[ ri ...r n })= Yl 

CC\N\,\C\={n<j-2<...<j n }i=l 

Here ri(j) denote the j-th coordinate of the vector r. 
To connect the two notions, observe that 

S{M[r 1 ...r n ])=Y,HMK 1 ...r an ]) 

a 

where a runs over all permutations on n items. 

The last function we consider is a "hybrid" function TL which has some 'symmetric' and 
some 'forward' properties. Let A = (Ai...Afc) be an ordered partition of [n] with k terms. For 
another such partition 9 = (9i...6k) of [n] write 9 ~ A if = |Ai|,...,|0fc| = |A&|. We define 

k 

W(M A [n...r fc ])= Enil r *^) 

cc [at] , 1 c\={h <j 2 <-. <j n }0~\t=iiee t 

An alternative view of the functions S,J-~ and 7i might be helpful at this point. Consider the 
set of paths which are one-to-one functions from [n] to [N] . Let us call a path p monotone on a 
subset {i\ < %2 < ... < ie} of [n] if p(i\) < p{h) < ■■■ < p{h)- A path is (fully) monotone if it 
is monotone on [n\. Then, for a partition A = (Ai...Afc) of [n] and an n x N matrix M = M\, 

n 

S(M)=Y IKp« 
allp i=1 
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monotone p *=1 

n 

n{M)= II m *,p«) 

p monotone on Ai...Afc i=1 
Note that for the function Ti, similarly to the symmetric function S, holds 



H (M (A ,.. Afc) [n...r fe ]) =h(m [rf Al|) ...rf 



Observe also that if A = ({l}...{n}) then <S(M) = H{M). If A = ({[n]}) then .F(M) = H(M) 
and 5(M) = n\ ■ T(M) = n\ ■ H{M). For a general A = (A ...A fc ) 



S(M) = ^Ill A *l ! ) - n ( M ) 



(4) 



Note that this is an identity in F. In particular, if one of the terms Aj has cardinality at least 
p then S(M) = and ([4j) provides no information. 

To simplify the notation we will usually write S(ri...r n ) for S(M[ri...r n ]), J 7 \(ri...rf : ) for 
T (M\[ri...rk\) and so on. 

2.2 Directional derivatives of symmetric polynomials 

The functions we have defined are relevant to the discussion here for two reasons. First, the 
elementary symmetric polynomial S n (x) in N variables can be viewed as the forward function 
T applied to the matrix M[x...x], where M has n identical rows equal to x. In our notation, 

S n (x) = F{[ n ]}(x) 

Second, it is possible to write a directional derivative (S n ) yi yk of S n of any order as a combi- 
nation of values of T on explicitly defined matrices M whose rows are either the indeterminate 
x or the directions yi. 

The basic observation here is the following lemma which is straightforward from the defini- 
tion of directional derivative. 

Lemma 2.1: Let a polynomial P(x) in N variables be given by 

p ( x ) = ^{\ ...\ h ){x,yx...y k ) 

Then 

p z{x)= ^ F(AM\AM...\ k )( x i z iyi--yk) 
Ac\ 

In words, when we take the derivative of such a polynomial in direction z, we replace some of 
the rows which contained x with z. 
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As a corollary we have a following expression for higher order derivatives of a symmetric 
polynomial. 

Proposition 2.2: Let k < n, then 

n—k 

ts-Wi-E E w (*"»>,<,?■>. 

Proof: Iterating Lemma \2.1\ 

( Sn \i-y k ( x } = Y F\(x,yi...y k ) 

A=(Ao,Ai...A fe ) 

where the summation is over partitions A such that Aj are not empty for i = l...k. Rearranging, 
this is 

n—k 

Y Y Y Fxfavi-Vk) = 

m=0 €i..-4>l, E^i=™-m A: |A |=m,|Ai|=£ 1 ...|A fc |=4 
n—k 

E E n{^\yf\..y^) 

I 

We can give explicit expressions for the coefficients of (S n ) yi yk (x). Fix m indices j\ < 
]2 < ... < jm for < m < n — k, and let a be the coefficient of Xj 1 ■ ■ ■ Xj m in (S n ) yi yk - 

Corollary 2.3: 

«i-4>l, Ei4=n-m 
• Ifk + m + p>n+\ then 

Y (iM •^•••->( y f i) ..4 4) ) 

Here, for a subset of indices T C [iV], 7i T (M) returns the value of the matrix function Ti. applied 
to the n x (iV — |T|) matrix obtained from M by deleting columns in T . The function S T {M) 
is defined similarly. 

Proof: The first claim is immediate from Proposition 12.21 The second claim follows from the 
first claim, from (JH), and from the simple observation that \ik + m + p>n+l then 1% <p for 
i = \...k in the above summation, which means £{\ is invertible in F p . I 
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Example 2.4: The following "toy" example will be relevant for the case of the binary field. It 
is sufficiently simple to illustrate what's going on behind the cumbersome formulas. Consider 
P = {S^jy z . Then P is a quadratic polynomial and for 1 < i < j < N 

coei x{t)xU) (P) = Yl y( k >^ = S{hj} (f ' z ) 

k^l, k,lg{i,j} 

I 

Continuing with the same example, note that it convenient to express the symmetric function 
S (y, z) via inner products of vectors y, z, 1, where 1 is the all-1 vector of length N. 

S(y, z) = J2 y(k)z(l) = (y, 1) • (z, 1} - (yz, 1) 
k+i 

Here we take yz to be the vector whose coordinates are point-wise inner products of the coor- 
dinates of y and z, that is (yz)(i) = y(i)z(i). Of course, {yz, 1) is the same as (y,z). 

Similarly, we can express the 'incomplete' symmetric function S^'^(y, z) via the complete 
symmetric function S(y,z) minus forbidden terms, as follows 

S^\y, z) = S(y, z) - (z(i) + z(jj) (y, 1) - (y(i) + y(j)) (z, 1} + (y(i)z(j) + y(j)z(i)) 

Note the "inclusion-exclusion" structure in the two expressions above. (To make it even clearer 
we use "+" and "-" notation, though in the binary field both are, of course, the same.) This 
structure becomes more evident as we pass to our next order of business, which is expressing, 
for general n and k, the coefficients of (S n ) yi yk via inner products of vectors y\...yk, 1. 

2.3 Inclusion-Exclusion formulas for symmetric functions 

Some notation: Given m vectors y\--.y m and a subset r C [m], let y T to be vector whose 
coordinates are point-wise products of the corresponding coordinates of yj, i£r. Let S (y[r]) 
for the value of the function S on a matrix with |r| rows j/j, i G r. Let (y T ) be the polynomial 

(Vt,1) = EjLi UierViU)- 

We start with an auxiliary lemma expressing the incomplete symmetric function (r\...r n ) 
as a polynomial in the fc-th coordinate of the vectors and in complete symmetric functions 
applied to sub- matrices of M[r\...r n \. 

Lemma 2.5: 

5«(r 1 ...r n )= ^(-l)M(|r|)!T T (fc)-5(r 

rC[n] 

From now on we assume r$ to be the all-1 vector, and S (r[0]) to equal 1. 
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Proof: The proof is by induction on n. For n = 1 both sides equal X^Li r i(i) ~~ r i{k)- 
For n > 1, observe that 

n 

<S« ( n ...r n ) = 5 (n...r n ) - X)ri(fc) • <S W (r [[n] \ {*}]) 

i=i 

and the claim is easily verified using the induction hypothesis. I 

Now we can state two main claims of this section. The first expresses the complete symmetric 
function <S {r\...r n ) via inner products (ry). 

Proposition 2.6: 

m 

s (n...r„) = E II ((-i) |AtK (M - 1)! • (»%>) 

A={Ai...A m }t=l 

In £/us summation A = {Ai...A m } runs over a// unordered partitions of [n] with non-empty Aj. 



Proof: Again, the proof is by induction on n. For n = 1 both sides equal X]j'=i r i(i)- For 
n > 1 we have 

N 

S{ n ...r n ) =Y J r n {h)-SW{r 1 ...r n - X ) 

k=l 

Using Lemma [2751 and the induction hypothesis, 

TV 

S(n...r n ) = • E (-l) |T| (H)!-fr(k)'5(r[[n-l]\T]) = 

fc=l rC[n-l] 

E (-l) |T| (|r|)!-<r rUN >-5(r[[n-l]\r 
rC[n-l] 

Consider the summand corresponding to r = [n— 1]. Recall the boundary assumption S (r[0]) = 
1. Hence this summand is (— l) n_1 (n — 1)! • (n n i). This summand therefore corresponds to the 
partition A = {[n]} in the claim of the proposition. 

For r a proper subset of [n — 1], we use the induction hypothesis to obtain 



S(n...r„)= E (-l) |r| (|r|)!-<r rUN >- E II ((-l) 1 * 1-1 " 1)! " + 
rC[n-l] 0={e 1 ...0,}t=l 

(-l) n ~Hn - 1)1 ■ (r [n] ) 

Here # runs over all the unordered partitions of [n — 1]\t with non-empty 0j. Observe that each 
pair (t,9) leads to a unique partition A = {A1...A/+1} = {9\...Qi,t\J [n]} of [n]. Rearranging the 
terms, the last summation can be written as 



A=(Ai...A m )t=l 
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completing the proof of the proposition. | 

The second claim expresses the incomplete symmetric function S^i—ik} (rj...r n ) as a poly- 
nomial in the missing coordinates j\ . . .jk of the vectors r j and in complete symmetric functions 
applied to sub-matrices of M[ri...r n ]. Note that Lemma [2.5l is a special case k = 1 of this claim. 

Proposition 2.7: 

k 

S^-M (n...r n ) = Il((-l) N (N) ! -^(7t)) -5(r 

r=(n...T fc ) *=1 

Here the summation is on all ordered set systems r such that the terms r t are disjoint subsets 
of [n] . The terms may also be empty. 

Proof: The proof is by induction on k and n. The case k = 1 is treated in Lemma 12.51 

Consider the case n = 1. On one hand S^—ik} (rj) = J2f=i r i(i) ~~ J2t=i r i (•?*)• We claim 
that this value can be also represented as 

k 

£ H{(-l) lTtl (\n\y--r Tt (3t))-S(r 

r=(ri...T k ) t=l 

Here T{ are disjoint subsets of [1]. Observe that there are k + 1 summands in this expression, 
corresponding to different set systems r. Let denote the set system with k empty terms, and 
let 7"W, for t = l...k denote the set system with Tt = {1} and all the remaining terms are empty. 
The summand corresponding to r(°) is 

& ( r i) = Ylf=i r iU)- The summand corresponding to 
7~W is (— r\ (jt)) ■ S (tq) = —r\ (jt), and we are done in this case. 

For k,n > 1, we have 

n 

S {n-h} (r x ...r n ) = S' 1 > (n...r„) - £r; • 5^-»> (r 

i=i 

By the induction hypothesis, this is 

fc-l 

0=(0i-A-i) * =1 

n fe 

E II ((-l^'CI^Dl-r^oCjt,)) -s(r 

1=1 fil / (i) U=l 

Here the summation is on all ordered set systems 9 such that the terms 6t are disjoint subsets 
of [n] and on ordered set systems i = l...n such that the terms /Lt« are disjoint subsets of 

H \ {»}• 



n 



\U t 71 



[i]\um 




\U t t 



v? 



\ U^ t (i) \ {<} 
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Given a set system 9 = we define a set system r = (ri...Tfe) by setting r t = 0t, 

t = l...k — 1 and = 0. Given a set system //W = we define a set system 

r = (Ti...Tfc) by setting r u = /i® , u = l...k — 1 and t& = /zj^ U {i}. In both cases we have 
obtained a set system of the type we want, that is an ordered family of k disjoint subsets 
of [n]. Moreover, each such system with empty k-th term is obtained exactly once, from the 
corresponding ^-system, and each system with non-empty fc-th term r& is obtained exactly |rfc| 
times, from systems y^' with i € Tfc. Rearranging the terms and the signs, the last expression 
is precisely 

k 

E II ((- 1 ) |rtl (l^l) ! • r n (it)) • S (r[[n] \ U t r t ] ) , 
T=(n...T fe ) i=l 

completing the proof. I 



2.4 Some properties of Gowers' norms 

The main result in this subsection shows that if a function from F^ to F is fixed on a subset of 
¥ N defined by low-degree polynomial constraints, then it has a non-trivial Gowers norm of an 
appropriate order. 

Recall that for a vector x G F , x % stands for a vector in ¥ N whose coordinates are i-th 
powers of the coordinates of x. 

Proposition 2.8: Let K be an absolute constant. Let yij, i = l,..p— 1, j = 1...K, be K(p— 1) 
vectors in ¥ N . Let M be a subset of¥ N defined by the constraints (x l ,yi.j) = for all i,j. 

Let f be a function from ¥ N to F. Assume that f is fixed on M. Then 

\\f\\up > (^Pj 2 =• Pr 2 {M] 

Proof: Let f\M = c 0- 

Consider a subspace V of polynomials of degree at most p — 1 in ¥[xi...xn] spanned by the 
polynomials for all We will first find a polynomial g E V such that | (/,<?) | > 

Pr{M}. This, combined with a lemma from [3], will imply the claim of the proposition. 

Let b = (bij), i = l...p — 1, j = 1...K, be a matrix with entries in F. Let c G F. Set 

/x(b,c) = Prjx : f(x) = c A (x\yij) = b itj for all 

Note that, by assumption, for a zero matrix b holds \x (b, cq) = Pr{M}. In other words, 
/i(b, c) = and for b = any c ^ cq. 

Now, for any g(x) = Ylij a i,j \ x% i Vi,j) m ^ holds 

</, 9 ) = Ee(/ - g) = £ /i(b, c) • e (c - (a, b» 
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where a = (aij)^ • and (a, b) = £\ j a i,jbij- Averaging over V, we have 

E g ev </,»> = i^EE ^ b ' c ) " e ( c ~ < a > b » = fjH E <"( b ' c ) ' e ( c ) E e (- < a ' b » = 

a b,c b,c a 

E MO, c) • e(c) = y. (0, c ) • e (c ) = Pr{M} • e (c ) 

c 

This means, there is g £ V with | (/, g) \ > Pr{M}. We conclude the proof of the proposition 
by quoting a lemma from [4], which states that | (/, g) \ > e implies ||/||c/p > e. I 

2.5 Asymptotic uniformity and independence of some random variables 

In this subsection we deal with another property of multiviarite polynomials. Let n be fixed 
integer and let N be an integer parameter growing to infinity. Let r\...r n be n vectors in F^. 
Let k = (ki...k n ) be a non-zero sequence of integers < h < p. For each such sequence define 
a polynomial X K (n, ...,r n ) = Y$Li U7=i r i'U)- 

Now,let r\...T n be chosen uniformly and independently from F . We claim that for a large 
./V the random variables X K (n, ...,r n ) are nearly independent and uniformly distributed over 
F. Let X = (X K ) K , and let K = p n . 

Proposition 2.9: Let U be the uniform distribution on ¥ K . Let P be distribution of X on ¥ K . 
Let || • || denote the statistical (l\) distance between distributions. 

Then there is a constant c > depending on n,p but not on N such that 

\\P-U\\ < exp{-ciV} 

Proof: We start from a simple observation that Fourier transform of a uniform distribution is 
the delta function at 0. In addition, the two following statements are equivalent up to constants: 
'a distribution is exponentially close to uniform' and 'all non-zero Fourier coefficients of the 
distribution are exponentially close to zero'. Accordingly, we will show that all the non-zero 
Fourier coefficients of P tend exponentially fast in N to zero. 

Consider a character x(u) = ^ v ' a \ corresponding to a non-zero vector a = {a K ) K G F^. 
(Recall that £ = e 2m / p is the p-th primitive root of unity.) Then, normalizing appropriately, 

Hx) = E p (yMy) = E Pr ^ x = v)-C Ek anVK = E ^ Ek " kXk 

y y 

Let P a denote the distribution of the random variable X a = 'Y1 ik o-kX k . Then we have shown 
P(x) = PaiX)- We will show the non-zero Fourier coefficients of P a to be exponentially small, 
completing the proof of the proposition. 

We have 

N n 

X a (r h ...,r n ) =E a « P «( r i'-' r «) = E E ° K 11^^ 

K j=l K=(k 1 ...k n ) i=l 
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Let Xi be elements of the field F. Consider an n-variate polynomial 



n 



Q(x 1 ...x n ) = ^2 



a K Yl x i 



1=1 



Since not all of the coefficients a K are zero, and since all k are non-zero sequences, Q is a 
multi-variate polynomial of degree at least 1 in F[xi...x n ], and therefore attains at least two 
values with probability bounded away from zero. Now, X a = J2f=i Q ( r i(i)--- r n(j)) is a sum of 
N independent copies of Q. Let \i denote the distribution of Q on F. Then the distribution P a 
of X a is fi* N , the iV-wise convolution of fj, with itself. Since p is prime, /I(0) = 1, and < 1 
everywhere else. Therefore, P a = (j2) N tends to the delta function at exponentially fast in N, 
completing the proof. I 

2.6 Estimates on the number of common zeroes of some families of polyno- 
mials 

The main claim of this subsection is the following proposition. 

Proposition 2.10: Let M be the ring of ¥ -valued functions on ¥ N , that is M = ¥{xi...xn]/I , 
where I is the ideal (x\ — x, ...,x p N — x). Let fi-.-fx be polynomials in M. Let S be the set of 
common zeroes of fi-.-fx, that is 



where J is the ideal generated by {f%}, and dim{M/J) denotes the dimension of dim(M/J), 
viewed as a vector space over ¥. 

Proof: For each u G S, let q u € M be defined by q u {u) = 1 and q u {v) = for all v ^ u. We 
will show that the family {q u + J} Mg 5 is linearly independent in Mj J. This will immediately 
imply the claim of the proposition. 

Consider a linear combination q = ^2 ueS ^ u Qu such that q € J. Let v G S. We compute q(u) 
in two ways. First, since q G J, we have q(v) = 0. On the other hand, q{v) = YlueS ^uQu( v ) = 
X v . This shows A„ = for all d£S, completing the proof. | 

In some cases, the dimension of M/J is easy to estimate. 

Lemma 2.11: Let p = 2, let K = {^), and let {//} be indexed by k-subsets I of [N]. Assume 
that for any such subset L holds 




Then 



S\ < dim{M/J) 




(5) 



13 



Then, 




Proof: We will construct a generating subset of the vector space Mj J of cardinality at most 
Sji=o Cj)- We- start from a trivial generating set {m + J}, where m runs through all the 
2^ multi-linear monomials in N variables. Now, in the factor space M/J, we can replace 
any product of k variables, Hie/ 2 -*' by a polynomial of degree smaller than k. Iterating this 
procedure, we arrive to a generating set spanned by {s+J}, where s now runs through ^j=o Cj) 
monomials of degree at most k — 1. I 



3 Proof of Theorem Q 

We need to show that 

ll^pllr/p+z > e 

for an absolute constant e. 

We remark that ([2]) can be shown exactly in the same way, replacing 2p with n and p + 2 
with n — p + 2 throughout. 

Recall ([!]) that ||,f||[/p+2 = E^J l|/y,z|lup- Since the Gowers' norms are nonnegative, it 
will suffice to show that H/^Ht/p is non-negligible for a non-negligible fraction of directions y, z. 

Let 

A = {(y, z) : (y a , z b \ = for all < a, b < p} 
By Proposition [231 for uniformly and independently chosen directions y, z, and for a sufficiently 

2 

large N, the probability of A is very close to p~ p . Therefore, A is a non-negligible event. We 
will now show that for any (y, z) € A holds H/y^Ht/p > e'(y,z), for an appropriate function e'. 

Fix (y, z) in A. Let / = (S 2p ) y ^ z . Let 

M = M(y, z) = |x : (x\y a z h ^ = for all 1 < i < p - 1, < a, b < p\ 

We will show that / is fixed on M. Assuming this, by Proposition 12.81 we have ||/y iZ ||c/p > 
Pr 2 {M}, and therefore 

||/||^ + + 2 2 = KyMAu> > Pr{A} ■E (ytZ)eA Pr 2P+1 {M(y,z)} > 

Pr{A}.E^ z)eA Pr{M(y,z)} > (Pr{A} ■ E MeA Pr{M(y, z)}f +1 = 

Pr 2P+1 {x : (x i y a z b ^ = for all < a, b, i < p - l} > Q (V p3 - 2P+1 ) 

The last inequality follows from Proposition 12.91 since random variables (x l y a z b ^ are asymp- 
totically uniform and independent. 

It remains to prove the following fact. 
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Lemma 3.1: Let x, z be three vectors in ¥ N satisfying (x l y a z b ^ = for all < a, 6, z < p—1. 
Then 

(S 2p ) ytZ (z)=w(y«,*«) 

Proof: By Proposition 12.21 

2p-2 

(s 2p ),,» = E E w(x (m, ,y (a, ,* w ) 

m=0 a,b>l, a+6=2p— m 

We claim that all of the summands on the right, except (possibly) 7i (y(p), z W) are 0. 

There are two possible cases to consider. The easier case is when a,b,m < p. In such a 
case, by @, H (x^ m \y^, z®) is proportional to S (x^ m \y^ a \ z^) . By Proposition the 
symmetric function S (x( m \ y^ a \ z^) is a polynomial in (x % y a z b ^, which vanishes when all of 
these inner products are 0. 

In the second case, one of the indices a, b, m is at least p. Note, that there could be at 
most one such index (barring the case a = b = p). We may assume this index is m. We claim 
that in this case 7i (xf™',|/",r") can be written as a linear combination of hybrid functions 
Ji [x^\r\, ...,r m -A, where t < m and the vectors rj are of the form x a y /3 z" / . Note that this will 
suffice to prove the lemma, since iterating this step will express Ti (x( m \ y( a \ zW) as a linear 
combination of symmetric functions in r^, and these functions vanish. 

Consider TL (x^ m \y (a \z^). For notational convenience, let wi...w a +b stand for the vectors 
y...y,z...z (y taken a times and z taken b times). Note that both a and b are smaller than p. 
Using Corollary 12.31 and Proposition 12.71 

n{x^\y^\z^) =(a\-b\y 1 - E x h x l2 ---x tm S^-^{y^,z^) = 

H<«2<-<j m 
m 

(al-W)" 1 - E x h x l2 ---x im E n((- 1 ) |Tfl (l T *l) ! -^^))- 5 ( u; 

*l<«2<---<« m T=(ri...r m ) t=l 

Here the inner summation is on all ordered set systems r such that the terms Tt are disjoint 
subsets of [a + b] . The terms may also be empty. 

Let us attempt to simplify the double summation we obtained. First, we may disregard the 
constant term (a! • b\)~ . Next, observe that, as before, all symmetric functions of the form 
<S (w[T"]) vanish, unless T is empty, in which case they equal 1. Therefore, we may consider the 
double summation 

m 

e ^ i2 ---x lm - n((- i ) |Tt| (i r *i) ! -^^)) 

ii<i2<...<im r=(ri...T m ) t=l 

Here the inner summation is on all ordered partitions r of [a + b]. The terms Tt may also be 
empty. Changing the order of summation, and ignoring the constant term ( — l) a+b , we get 

m m / in \ 

E II(l r *l) ! ' E ]J( X ■ w n) (k) = E [n(\n\y-yF(xw T1 ,xw T2 ,...,xw Tm ) 

T = (ri...T m ) t=l il<i2<...<im t=l r=(Tl...T m ) \t=l / 



[a + b]\ U t r t 
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Consider the last expression. Let us use some more notation. For an ordered partition r = 
(ri...r m ), let n = n(r) be the number of empty terms. Let {ri...r m } denote the unordered 
version of this partition, where the first n(r) terms are taken, by agreement, to be the empty 
ones. Then we can rewrite this expression as 

E (f[(N) ! ) • H (x^ , xw Tn+1 xw Tm ) 

T={r 1 ...T m } \t=l / 

Now, clearly not all the terms in the partition are empty and, therefore, n{r) < m for all r, 
completing the proof of our last claim, of the lemma, and of the theorem. | 



4 Proof of Theorem 11.31 

Let p = 2. We will show there is an absolute constant a > such that for any polynomial g of 
degree at most 3 in N variables holds 

(5 4 , g) < exp{-aiV} 

A first step is to observe that there is a relation between the inner product of two functions 
and the average inner product of their derivatives. 

Lemma 4.1: For any two functions f and g holds 

(f,g) 4 < E y (f y ,g y ) 2 

Proof: This is an immediate corollary of a lemma in [7], but we give the elementary proof for 
completeness. By the Cauchy-Schwarz inequality, 

Ey (f y ,g y ) 2 > E 2 y (f y ,g y ) = E^(-l)/W+/(^)+9(-)+ff(-+?/) = E 4 (-l)/W+^) = (f,g) 4 

I 



Corollary 4.2: 



(f,g) 8 < ^y,z {fy,z,9y,z) 2 



We will show that for any polynomial g of degree at most 3 holds E VyZ (^S^) y z , g Vy z) < 
exp{— aN}. First, here is a brief overview of the argument. 

The point is that taking second derivatives makes life easier, since a second derivative of g 
is a linear function, and a second derivative of S4 is a quadratic. We therefore need to show 
that for the large majority of directions y, z, the quadratic function (£4)^ z has a small inner 
product with the linear function (— l) 9y - z . In this we will be helped by a theorem of Dixon 
giving a structural description of quadratic polynomials, which, in particular, characterizes the 
Fourier transform of functions of the type (—1)^, where Q is a quadratic. In fact, setting 
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Q = (Si) y z we will see that for many of the directions y,z the Fourier coefficients of (— 1)** 
will be exponentially small. For the remaining directions, these Fourier coefficients will be 
supported on an explicit easy to describe 3-dimensional afhne subspace depending on y, z. We 
will then argue that for any fixed polynomial g of lower degree, the support of the character 
(— \) g y> z lies in this afiine subspace with exponentially small probability over y,z. 

We proceed with computing the second derivative Q = (S4) . 

4.1 Second derivatives of £4 

Write Q(x) = Y, i<:j qi,jx(i)x(j) + + c - 

By Proposition 12.21 or by Example 12.41 

q itj = S(y, z) - {y, 1} • (z(i) + + (z, 1) • (y(i) + y(jfj + (y{i)z(j) + y(j)z(i)^j 

At this point we invoke (a corollary of) a theorem of Dixon [6]: 

Theorem 4.3: Let Q(x) = Y2i<j H,j x {'' , ) x U) + Y2i^i x {^) + c be a quadratic polynomial over 
F2. Consider the symmetric matrix with zeros on the diagonal and off- diagonal entries given 
by Sij = Sj t i = qij. Let the rank of B = 2h (it is always even). Then the function (— 1)** has 
2 2h non-zero Fourier coefficients of absolute value 2~ h . Moreover, all these coefficients lie in 
an 2h- dimensional affine subspace ofFl}. 

Consider the matrix B in our case. Some notation: let J be the matrix with on the 
diagonal and 1 off the diagonal. Let u ® v denote the outer product uv l . Then, 

B = S(y, z)-J+{y,l)-(z®l + l®z) + {z,l)-(y®l + l®y) + (y®z + z®y\ 

Since the rank of J is at least N — 1 and the rank of the remaining matrices is at most 2, 
the matrix B is almost of full rank if S(y,z) = 1. In this case, by Theorem 14.31 the Fourier 
coefficients of (—1)^ are exponentially small. 

We therefore may assume S(y, z) = 0. In this case the quadratic part of Q may be written 

as 

^2 IhM^U) = (V, !) • (x, 1) (x, z) + (z, 1} • (x, 1) (x, y) + ( (x, y) (x, z) + (x, yz) ) 

i<j 

Recall that yz denotes the pointwise product of vectors y and z. 

This implies the non-zero Fourier coefficients of Y2i<j QiJ x (}) x (j) ne m a 3-dimensional 
affine subspace of F^. The linear part of this subspace is spanned by the vectors y, z, 1 and it 
is shifted by a vector yz. 

Next, consider the linear part ^2i^(i)x(i) of Q. By Proposition 12.21 
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£ (y(k)y(l)z(j)+y(j)y(l)z(k)+y(j)y(k)z(l^ + (y(j>(A;)z(/)+y(A:)z(j>(0+y(0^(j>(^)) 

j<k<l^i 

This can be directly verified to be equal to 

(S{y, z) + S(z, z) + (z, 1} ) • y(z) + (<S(y, z) + S(y, y) + (y, 1} ) • z(i)+ 

(S{y, y) ■ (z, 1} + S(z, z) ■ (y, 1) + (y, z) ■ (y + z, 1} ) 

By assumption, S(y, z) = (y, 1} • (z, 1) + (y, z) = 0. Note that this also implies (y, z)-(y + z, 1) = 
0, implying 

£(i) = (S(z, z) + (z, 1) ) • y(i) + ((5(y, y) + (y, 1) ) • z(i) + («S(y, y) • (z, 1) + 5(z, z) • (y, 1) ) 
Consequently, the linear part of Q may be written as 

(S(z, z) + (z, 1) ) • (x, y) + ({S(y, y) + (y, 1} ) • (x, z) + (<S(y, y) • (z, 1} + 5(z, z) • (y, 1) ) • (x, 1) 

This means that the non-zero Fourier coefficients of the polynomial Q = J2i<j 9«,j x W x (i) + 
J2 i £(i)x(i) + c lie in the affine subspace AF y ^ z = yz + Span (y, z, 1). 

4.2 Second derivatives of a fixed polynomial of degree 3 

Let 

g(x) = o-i,3, kx{i)x{j)x{k) 

i<j<k 

be a polynomial of degree 3. For directions y,z G F^, consider the second derivative g VjZ = 
Yli v y,z{i)x{i) + c V;Z . We need to show that the probability of the vector v ViZ falling in the affine 
space AFy^ z = yz + Span (y, z, 1) is exponentially small. 

First, some notation. For 1 < i < N, let Gi be a symmetric N x N matrix over F with 
{Gi)- k = (Gi) k j = aij : k for all j ^ k. (Here we think about {i,j,k} as an unordered subset 
of [N].) The diagonal entries of Gi are set to 0. For future use note the important property 

( G i)j,k = ( G i)i,k = ( G k)i,j- 

These matrices are relevant because they describe the vector v y;Z . 

Lemma 4.4: 

v y ,z(i) = coef x(i) (g y , z (x)) = {y,Giz) 

• An alternative representation of v y>z will be more convenient for us. For z £ F^, let 
G{z) = Y^=iz{i)Gi. Then 

v y ,z = G(z) ■ y 
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Proof: For the first claim of the lemma, by linearity of the derivative, it suffices to consider 
the monomial g{x) = x(i)x(j)x(k). This case can be easily verified directly. 

For the second claim, note that 

N N N N N 

(G(z).y)(l) = Y J (G(z)) ktl y(k) = •!>(<) = (G,) M *(i) = <»,G,z> 

fc=l fc = l 1=1 fc=l 1 = 1 

I 

Consider the event {v V)Z & AF y z }. This means v y z = yz + u^, for some vector u y z € 
Span(y, z, 1). There are only 8 possible choices for u y>z . For convenience, let us assume, without 
loss of generality (as can be easily seen from the proof), that u yz = y + z + 1 is the most popular 
one. By the lemma, the event = yz + %,z} is the same as {G(z) ■ y = yz + Uy,z}- To 

simplify things some more, let Ai = Gi + e% ® e^, z = 1.. JV. That is, ^ = G» but for (-Aj)j i = 1. 

Let -A(z) = X^i ^(i)-^*- Note that -A(-z) • y = G(z) • y + yz. Hence {G{z) - y = yz + u y>z } is the 
same as {A(z) ■ y = u y>z = y + z + 1} 

We conclude the proof by a technical claim. 

Proposition 4.5: Let {^4i}, i = 1...N be a family of symmetric N x N matrices over F with 
Ai(k, k) = 8ij~. Then, for y, z uniformly at random and independently from ¥ N , 

Pr y , z {{A{z))-y = y + z + l} < 

The proof of the proposition is based on the claim that the rank of a matrix A(z) is typically 
large. 

Lemma 4.6: Let matrices {Ai} be as in the proposition. Let C be any fixed symmetric N x N 
matrix. Then 

. fe-i 

Pr z {rank(A(z) + C)<fc-l}<^-^ 

i=0 

Proof: Consider a family of (, ) polynomials /j on ¥ N . These polynomials are indexed by 
fc-subsets of [N]. For a fc-subset I, let fi(z) be the determinant of the I x I minor of A{z) + C. 
Clearly, rank of A(z) + C is smaller than k if and only if z is a joint zero of {//}. 

We now claim that the coefficient of Y\ ieI Z{ in fiiz) is 1. If this is true, deg(// — Yl ieI Zi) < 
k — 1, and the claim of the lemma will follow from Lemma 14.61 

Let B(z) = A(z) + C. Since we are working in characteristic two, the symmetry of B(z) 
implies that 

N 

det B{z)= U B M^ Z ) = 

<t(zSn: <T=a~ 1 2=1 

n 

^ ] J (Zi + Ci t i) ■ Y\ B iv{i) ( Z ) = II Z i + loWer ° rder terms - 

<t€Sjv ; cr=<J~ 1 {i:cr(i)=i} {i:i<er(i)} i£j 
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In the second equality we use the identity B?,.Jz) = B icr ^(z) in F. | 
Let I denote the identity N x N matrix. 

Let p(z) = Pr y ^A{z) ■ y = y + z + l}. Clearly p(z) < 2- rank ( A W+ / ). By Lemma Ml 

Pr y , z { (A(z)) .y = y + z + l}= E zPz < E Z 2-^ A ^ < * £ (f) !T* = 
This concludes the proof of the proposition, and of Theorem 11,31 
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